Overview
Brought to you by YData
Dataset statistics
| Number of variables | 19 |
|---|---|
| Number of observations | 2154048 |
| Missing cells | 14380032 |
| Missing cells (%) | 35.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 920.3 MiB |
| Average record size in memory | 448.0 B |
Variable types
| Text | 2 |
|---|---|
| Categorical | 3 |
| Numeric | 13 |
| Boolean | 1 |
MRG has constant value "False" | Constant |
ARPU_SEGMENT is highly overall correlated with FREQUENCE and 7 other fields | High correlation |
CHURN is highly overall correlated with REGULARITY | High correlation |
FREQUENCE is highly overall correlated with ARPU_SEGMENT and 6 other fields | High correlation |
FREQUENCE_RECH is highly overall correlated with ARPU_SEGMENT and 6 other fields | High correlation |
FREQ_TOP_PACK is highly overall correlated with ARPU_SEGMENT and 6 other fields | High correlation |
MONTANT is highly overall correlated with ARPU_SEGMENT and 7 other fields | High correlation |
ON_NET is highly overall correlated with ARPU_SEGMENT and 4 other fields | High correlation |
ORANGE is highly overall correlated with ARPU_SEGMENT and 6 other fields | High correlation |
REGULARITY is highly overall correlated with ARPU_SEGMENT and 7 other fields | High correlation |
REVENUE is highly overall correlated with ARPU_SEGMENT and 7 other fields | High correlation |
TENURE is highly imbalanced (86.4%) | Imbalance |
REGION has 849299 (39.4%) missing values | Missing |
MONTANT has 756739 (35.1%) missing values | Missing |
FREQUENCE_RECH has 756739 (35.1%) missing values | Missing |
REVENUE has 726048 (33.7%) missing values | Missing |
ARPU_SEGMENT has 726048 (33.7%) missing values | Missing |
FREQUENCE has 726048 (33.7%) missing values | Missing |
DATA_VOLUME has 1060433 (49.2%) missing values | Missing |
ON_NET has 786675 (36.5%) missing values | Missing |
ORANGE has 895248 (41.6%) missing values | Missing |
TIGO has 1290016 (59.9%) missing values | Missing |
ZONE1 has 1984327 (92.1%) missing values | Missing |
ZONE2 has 2017224 (93.6%) missing values | Missing |
TOP_PACK has 902594 (41.9%) missing values | Missing |
FREQ_TOP_PACK has 902594 (41.9%) missing values | Missing |
DATA_VOLUME is highly skewed (γ1 = 36.25674263) | Skewed |
ZONE1 is highly skewed (γ1 = 25.70889323) | Skewed |
ZONE2 is highly skewed (γ1 = 30.88518917) | Skewed |
user_id has unique values | Unique |
DATA_VOLUME has 320153 (14.9%) zeros | Zeros |
ON_NET has 108046 (5.0%) zeros | Zeros |
ORANGE has 61623 (2.9%) zeros | Zeros |
TIGO has 94270 (4.4%) zeros | Zeros |
ZONE1 has 59935 (2.8%) zeros | Zeros |
ZONE2 has 40440 (1.9%) zeros | Zeros |
Reproduction
| Analysis started | 2025-07-09 19:17:30.676501 |
|---|---|
| Analysis finished | 2025-07-09 19:19:39.275438 |
| Duration | 2 minutes and 8.6 seconds |
| Software version | ydata-profiling vv4.16.1 |
| Download configuration | config.json |
Variables
user_id
Text
Unique 
| Distinct | 2154048 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 199.3 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 40 |
| Mean length | 40 |
| Min length | 40 |
Unique
| Unique | 2154048 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 00000bfd7d50f01092811bc0c8d7b0d6fe7c3596 |
|---|---|
| 2nd row | 00000cb4a5d760de88fecb38e2f71b7bec52e834 |
| 3rd row | 00001654a9d9f96303d9969d0a4a851714a4bb57 |
| 4th row | 00001dd6fa45f7ba044bd5d84937be464ce78ac2 |
| 5th row | 000028d9e13a595abe061f9b58f3d76ab907850f |
| Value | Count | Frequency (%) |
| ffff56138e6bf8e553514dfb97ee7cbe0f6cc609 | 1 | < 0.1% |
| ffff5956b3770fca0c1f5a1c907d74ff603e8ff9 | 1 | < 0.1% |
| ffff6410f18958f9558a229475cfc54bc2b158ef | 1 | < 0.1% |
| ffff6e41acb8a069e888c4e8fbd9779f1e0bde73 | 1 | < 0.1% |
| ffff8da611b1f7591fae91245f93a6dcf276056a | 1 | < 0.1% |
| ffffa921a44c8611cf12a5a0277c4238d4a63749 | 1 | < 0.1% |
| ffffb2b8b63959b8a374e2a2ccaf2b9e521879ad | 1 | < 0.1% |
| ffffc38e1c3cb77a88941e739c358fd96bce3238 | 1 | < 0.1% |
| ffffccdae4d9097c20f95e87f5c89845cab4eff3 | 1 | < 0.1% |
| ffffd1d48dd02c059c82c70b8793c8dfa3d09593 | 1 | < 0.1% |
| Other values (2154038) | 2154038 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 5393160 | 6.3% |
| b | 5389543 | 6.3% |
| c | 5387121 | 6.3% |
| 0 | 5387072 | 6.3% |
| 9 | 5385666 | 6.3% |
| d | 5385205 | 6.3% |
| a | 5384709 | 6.2% |
| 5 | 5384389 | 6.2% |
| 8 | 5384268 | 6.2% |
| 4 | 5383987 | 6.2% |
| Other values (6) | 32296800 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 86161920 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2 | 5393160 | 6.3% |
| b | 5389543 | 6.3% |
| c | 5387121 | 6.3% |
| 0 | 5387072 | 6.3% |
| 9 | 5385666 | 6.3% |
| d | 5385205 | 6.3% |
| a | 5384709 | 6.2% |
| 5 | 5384389 | 6.2% |
| 8 | 5384268 | 6.2% |
| 4 | 5383987 | 6.2% |
| Other values (6) | 32296800 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 86161920 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2 | 5393160 | 6.3% |
| b | 5389543 | 6.3% |
| c | 5387121 | 6.3% |
| 0 | 5387072 | 6.3% |
| 9 | 5385666 | 6.3% |
| d | 5385205 | 6.3% |
| a | 5384709 | 6.2% |
| 5 | 5384389 | 6.2% |
| 8 | 5384268 | 6.2% |
| 4 | 5383987 | 6.2% |
| Other values (6) | 32296800 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 86161920 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2 | 5393160 | 6.3% |
| b | 5389543 | 6.3% |
| c | 5387121 | 6.3% |
| 0 | 5387072 | 6.3% |
| 9 | 5385666 | 6.3% |
| d | 5385205 | 6.3% |
| a | 5384709 | 6.2% |
| 5 | 5384389 | 6.2% |
| 8 | 5384268 | 6.2% |
| 4 | 5383987 | 6.2% |
| Other values (6) | 32296800 |
REGION
Categorical
Missing 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 849299 |
| Missing (%) | 39.4% |
| Memory size | 130.6 MiB |
| DAKAR | |
|---|---|
| THIES | |
| SAINT-LOUIS | |
| LOUGA | |
| KAOLACK | |
| Other values (9) |
Length
| Max length | 11 |
|---|---|
| Median length | 5 |
| Mean length | 6.3267073 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FATICK |
|---|---|
| 2nd row | DAKAR |
| 3rd row | DAKAR |
| 4th row | LOUGA |
| 5th row | LOUGA |
Common Values
| Value | Count | Frequency (%) |
| DAKAR | 513271 | |
| THIES | 180052 | 8.4% |
| SAINT-LOUIS | 119886 | 5.6% |
| LOUGA | 99053 | 4.6% |
| KAOLACK | 96986 | 4.5% |
| DIOURBEL | 66911 | 3.1% |
| TAMBACOUNDA | 55074 | 2.6% |
| KAFFRINE | 43963 | 2.0% |
| KOLDA | 38743 | 1.8% |
| FATICK | 35643 | 1.7% |
| Other values (4) | 55167 | 2.6% |
| (Missing) | 849299 |
Length
| Value | Count | Frequency (%) |
| dakar | 513271 | |
| thies | 180052 | 13.8% |
| saint-louis | 119886 | 9.2% |
| louga | 99053 | 7.6% |
| kaolack | 96986 | 7.4% |
| diourbel | 66911 | 5.1% |
| tambacounda | 55074 | 4.2% |
| kaffrine | 43963 | 3.4% |
| kolda | 38743 | 3.0% |
| fatick | 35643 | 2.7% |
| Other values (4) | 55167 | 4.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1781190 | |
| K | 826612 | |
| D | 678138 | 8.2% |
| R | 646090 | 7.8% |
| I | 613350 | 7.4% |
| O | 503757 | 6.1% |
| S | 422943 | 5.1% |
| L | 421579 | 5.1% |
| T | 419738 | 5.1% |
| U | 368028 | 4.5% |
| Other values (10) | 1573340 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 8254765 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| A | 1781190 | |
| K | 826612 | |
| D | 678138 | 8.2% |
| R | 646090 | 7.8% |
| I | 613350 | 7.4% |
| O | 503757 | 6.1% |
| S | 422943 | 5.1% |
| L | 421579 | 5.1% |
| T | 419738 | 5.1% |
| U | 368028 | 4.5% |
| Other values (10) | 1573340 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 8254765 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| A | 1781190 | |
| K | 826612 | |
| D | 678138 | 8.2% |
| R | 646090 | 7.8% |
| I | 613350 | 7.4% |
| O | 503757 | 6.1% |
| S | 422943 | 5.1% |
| L | 421579 | 5.1% |
| T | 419738 | 5.1% |
| U | 368028 | 4.5% |
| Other values (10) | 1573340 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 8254765 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| A | 1781190 | |
| K | 826612 | |
| D | 678138 | 8.2% |
| R | 646090 | 7.8% |
| I | 613350 | 7.4% |
| O | 503757 | 6.1% |
| S | 422943 | 5.1% |
| L | 421579 | 5.1% |
| T | 419738 | 5.1% |
| U | 368028 | 4.5% |
| Other values (10) | 1573340 |
TENURE
Categorical
Imbalance 
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 141.8 MiB |
| K > 24 month | |
|---|---|
| I 18-21 month | 45278 |
| H 15-18 month | 26006 |
| G 12-15 month | 14901 |
| J 21-24 month | 12725 |
| Other values (3) | 11937 |
Length
| Max length | 13 |
|---|---|
| Median length | 12 |
| Mean length | 12.044707 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | K > 24 month |
|---|---|
| 2nd row | I 18-21 month |
| 3rd row | K > 24 month |
| 4th row | K > 24 month |
| 5th row | K > 24 month |
Common Values
| Value | Count | Frequency (%) |
| K > 24 month | 2043201 | |
| I 18-21 month | 45278 | 2.1% |
| H 15-18 month | 26006 | 1.2% |
| G 12-15 month | 14901 | 0.7% |
| J 21-24 month | 12725 | 0.6% |
| F 9-12 month | 9328 | 0.4% |
| E 6-9 month | 1839 | 0.1% |
| D 3-6 month | 770 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| month | 2154048 | |
| k | 2043201 | |
| 2043201 | ||
| 24 | 2043201 | |
| i | 45278 | 0.5% |
| 18-21 | 45278 | 0.5% |
| h | 26006 | 0.3% |
| 15-18 | 26006 | 0.3% |
| g | 14901 | 0.2% |
| 12-15 | 14901 | 0.2% |
| Other values (8) | 49324 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 6351297 | ||
| n | 2154048 | 8.3% |
| o | 2154048 | 8.3% |
| m | 2154048 | 8.3% |
| h | 2154048 | 8.3% |
| t | 2154048 | 8.3% |
| 2 | 2138158 | 8.2% |
| 4 | 2055926 | 7.9% |
| K | 2043201 | 7.9% |
| > | 2043201 | 7.9% |
| Other values (14) | 542854 | 2.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 25944877 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 6351297 | ||
| n | 2154048 | 8.3% |
| o | 2154048 | 8.3% |
| m | 2154048 | 8.3% |
| h | 2154048 | 8.3% |
| t | 2154048 | 8.3% |
| 2 | 2138158 | 8.2% |
| 4 | 2055926 | 7.9% |
| K | 2043201 | 7.9% |
| > | 2043201 | 7.9% |
| Other values (14) | 542854 | 2.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 25944877 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 6351297 | ||
| n | 2154048 | 8.3% |
| o | 2154048 | 8.3% |
| m | 2154048 | 8.3% |
| h | 2154048 | 8.3% |
| t | 2154048 | 8.3% |
| 2 | 2138158 | 8.2% |
| 4 | 2055926 | 7.9% |
| K | 2043201 | 7.9% |
| > | 2043201 | 7.9% |
| Other values (14) | 542854 | 2.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 25944877 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 6351297 | ||
| n | 2154048 | 8.3% |
| o | 2154048 | 8.3% |
| m | 2154048 | 8.3% |
| h | 2154048 | 8.3% |
| t | 2154048 | 8.3% |
| 2 | 2138158 | 8.2% |
| 4 | 2055926 | 7.9% |
| K | 2043201 | 7.9% |
| > | 2043201 | 7.9% |
| Other values (14) | 542854 | 2.1% |
MONTANT
Real number (ℝ)
High correlation  Missing 
| Distinct | 6540 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 756739 |
| Missing (%) | 35.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5532.117 |
| Minimum | 10 |
|---|---|
| Maximum | 470000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.4 MiB |
Quantile statistics
| Minimum | 10 |
|---|---|
| 5-th percentile | 250 |
| Q1 | 1000 |
| median | 3000 |
| Q3 | 7350 |
| 95-th percentile | 18500 |
| Maximum | 470000 |
| Range | 469990 |
| Interquartile range (IQR) | 6350 |
Descriptive statistics
| Standard deviation | 7111.3394 |
|---|---|
| Coefficient of variation (CV) | 1.2854644 |
| Kurtosis | 57.528484 |
| Mean | 5532.117 |
| Median Absolute Deviation (MAD) | 2400 |
| Skewness | 4.2297262 |
| Sum | 7.7300769 × 109 |
| Variance | 50571148 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 500 | 112976 | 5.2% |
| 1000 | 82997 | 3.9% |
| 1500 | 48710 | 2.3% |
| 2000 | 46122 | 2.1% |
| 200 | 40004 | 1.9% |
| 3000 | 34831 | 1.6% |
| 2500 | 32026 | 1.5% |
| 4000 | 24109 | 1.1% |
| 3500 | 23793 | 1.1% |
| 100 | 20188 | 0.9% |
| Other values (6530) | 931553 | |
| (Missing) | 756739 |
| Value | Count | Frequency (%) |
| 10 | 3 | < 0.1% |
| 20 | 2 | < 0.1% |
| 22 | 1 | < 0.1% |
| 25 | 2 | < 0.1% |
| 30 | 2 | < 0.1% |
| 35 | 1 | < 0.1% |
| 37 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 48 | 1 | < 0.1% |
| 50 | 360 |
| Value | Count | Frequency (%) |
| 470000 | 1 | |
| 290500 | 1 | |
| 286500 | 1 | |
| 265000 | 1 | |
| 259500 | 1 | |
| 256000 | 1 | |
| 235500 | 1 | |
| 235000 | 1 | |
| 231000 | 2 | |
| 230600 | 1 |
FREQUENCE_RECH
Real number (ℝ)
High correlation  Missing 
| Distinct | 123 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 756739 |
| Missing (%) | 35.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.52912 |
| Minimum | 1 |
|---|---|
| Maximum | 133 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 7 |
| Q3 | 16 |
| 95-th percentile | 40 |
| Maximum | 133 |
| Range | 132 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 13.27407 |
|---|---|
| Coefficient of variation (CV) | 1.1513515 |
| Kurtosis | 5.3169562 |
| Mean | 11.52912 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 2.1119879 |
| Sum | 16109743 |
| Variance | 176.20092 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 219471 | 10.2% |
| 2 | 139897 | 6.5% |
| 3 | 110506 | 5.1% |
| 4 | 88889 | 4.1% |
| 5 | 74527 | 3.5% |
| 6 | 64115 | 3.0% |
| 7 | 55616 | 2.6% |
| 8 | 49983 | 2.3% |
| 9 | 44715 | 2.1% |
| 10 | 40655 | 1.9% |
| Other values (113) | 508935 | |
| (Missing) | 756739 |
| Value | Count | Frequency (%) |
| 1 | 219471 | |
| 2 | 139897 | |
| 3 | 110506 | |
| 4 | 88889 | |
| 5 | 74527 | 3.5% |
| 6 | 64115 | 3.0% |
| 7 | 55616 | 2.6% |
| 8 | 49983 | 2.3% |
| 9 | 44715 | 2.1% |
| 10 | 40655 | 1.9% |
| Value | Count | Frequency (%) |
| 133 | 1 | < 0.1% |
| 132 | 1 | < 0.1% |
| 131 | 1 | < 0.1% |
| 122 | 1 | < 0.1% |
| 121 | 1 | < 0.1% |
| 119 | 1 | < 0.1% |
| 118 | 1 | < 0.1% |
| 117 | 2 | |
| 115 | 4 | |
| 114 | 2 |
REVENUE
Real number (ℝ)
High correlation  Missing 
| Distinct | 38114 |
|---|---|
| Distinct (%) | 2.7% |
| Missing | 726048 |
| Missing (%) | 33.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5510.8103 |
| Minimum | 1 |
|---|---|
| Maximum | 532177 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 199 |
| Q1 | 1000 |
| median | 3000 |
| Q3 | 7368 |
| 95-th percentile | 18791 |
| Maximum | 532177 |
| Range | 532176 |
| Interquartile range (IQR) | 6368 |
Descriptive statistics
| Standard deviation | 7187.1129 |
|---|---|
| Coefficient of variation (CV) | 1.3041844 |
| Kurtosis | 64.821825 |
| Mean | 5510.8103 |
| Median Absolute Deviation (MAD) | 2498 |
| Skewness | 4.1890021 |
| Sum | 7.8694372 × 109 |
| Variance | 51654592 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 500 | 58783 | 2.7% |
| 1000 | 36269 | 1.7% |
| 1500 | 20740 | 1.0% |
| 200 | 20043 | 0.9% |
| 2000 | 18220 | 0.8% |
| 3000 | 13211 | 0.6% |
| 2500 | 12096 | 0.6% |
| 3500 | 8727 | 0.4% |
| 4000 | 8303 | 0.4% |
| 100 | 7893 | 0.4% |
| Other values (38104) | 1223715 | |
| (Missing) | 726048 |
| Value | Count | Frequency (%) |
| 1 | 4295 | |
| 2 | 3134 | |
| 3 | 211 | < 0.1% |
| 4 | 1961 | |
| 5 | 104 | < 0.1% |
| 6 | 1111 | 0.1% |
| 7 | 522 | < 0.1% |
| 8 | 1225 | 0.1% |
| 9 | 1230 | 0.1% |
| 10 | 2691 |
| Value | Count | Frequency (%) |
| 532177 | 1 | |
| 397968 | 1 | |
| 323541 | 1 | |
| 272191 | 1 | |
| 266050 | 1 | |
| 244001 | 1 | |
| 240094 | 1 | |
| 233583 | 1 | |
| 233413 | 1 | |
| 233141 | 1 |
ARPU_SEGMENT
Real number (ℝ)
High correlation  Missing 
| Distinct | 16535 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 726048 |
| Missing (%) | 33.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1836.9429 |
| Minimum | 0 |
|---|---|
| Maximum | 177392 |
| Zeros | 4295 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 66 |
| Q1 | 333 |
| median | 1000 |
| Q3 | 2456 |
| 95-th percentile | 6264 |
| Maximum | 177392 |
| Range | 177392 |
| Interquartile range (IQR) | 2123 |
Descriptive statistics
| Standard deviation | 2395.7 |
|---|---|
| Coefficient of variation (CV) | 1.3041777 |
| Kurtosis | 64.822078 |
| Mean | 1836.9429 |
| Median Absolute Deviation (MAD) | 833 |
| Skewness | 4.1890192 |
| Sum | 2.6231545 × 109 |
| Variance | 5739378.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 167 | 67878 | 3.2% |
| 333 | 43705 | 2.0% |
| 500 | 28568 | 1.3% |
| 667 | 22898 | 1.1% |
| 67 | 22753 | 1.1% |
| 1000 | 18483 | 0.9% |
| 833 | 15341 | 0.7% |
| 1167 | 11524 | 0.5% |
| 1333 | 10725 | 0.5% |
| 33 | 10473 | 0.5% |
| Other values (16525) | 1175652 | |
| (Missing) | 726048 |
| Value | Count | Frequency (%) |
| 0 | 4295 | |
| 1 | 5306 | |
| 2 | 1737 | 0.1% |
| 3 | 5146 | |
| 4 | 2755 | |
| 5 | 1819 | 0.1% |
| 6 | 1175 | 0.1% |
| 7 | 3439 | |
| 8 | 728 | < 0.1% |
| 9 | 1090 | 0.1% |
| Value | Count | Frequency (%) |
| 177392 | 1 | |
| 132656 | 1 | |
| 107847 | 1 | |
| 90730 | 1 | |
| 88683 | 1 | |
| 81334 | 1 | |
| 80031 | 1 | |
| 77861 | 1 | |
| 77804 | 1 | |
| 77714 | 1 |
FREQUENCE
Real number (ℝ)
High correlation  Missing 
| Distinct | 91 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 726048 |
| Missing (%) | 33.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.978141 |
| Minimum | 1 |
|---|---|
| Maximum | 91 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 9 |
| Q3 | 20 |
| 95-th percentile | 45 |
| Maximum | 91 |
| Range | 90 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 14.694035 |
|---|---|
| Coefficient of variation (CV) | 1.0512152 |
| Kurtosis | 3.402515 |
| Mean | 13.978141 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 1.7750807 |
| Sum | 19960786 |
| Variance | 215.91466 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 161585 | 7.5% |
| 2 | 116460 | 5.4% |
| 3 | 95237 | 4.4% |
| 4 | 82338 | 3.8% |
| 5 | 71867 | 3.3% |
| 6 | 64228 | 3.0% |
| 7 | 57343 | 2.7% |
| 8 | 51893 | 2.4% |
| 9 | 47532 | 2.2% |
| 10 | 43694 | 2.0% |
| Other values (81) | 635823 | |
| (Missing) | 726048 |
| Value | Count | Frequency (%) |
| 1 | 161585 | |
| 2 | 116460 | |
| 3 | 95237 | |
| 4 | 82338 | |
| 5 | 71867 | |
| 6 | 64228 | 3.0% |
| 7 | 57343 | 2.7% |
| 8 | 51893 | 2.4% |
| 9 | 47532 | 2.2% |
| 10 | 43694 | 2.0% |
| Value | Count | Frequency (%) |
| 91 | 83 | < 0.1% |
| 90 | 84 | < 0.1% |
| 89 | 126 | < 0.1% |
| 88 | 165 | < 0.1% |
| 87 | 214 | |
| 86 | 291 | |
| 85 | 268 | |
| 84 | 322 | |
| 83 | 362 | |
| 82 | 481 |
DATA_VOLUME
Real number (ℝ)
Missing  Skewed  Zeros 
| Distinct | 41550 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 1060433 |
| Missing (%) | 49.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3366.4502 |
| Minimum | 0 |
|---|---|
| Maximum | 1823866 |
| Zeros | 320153 |
| Zeros (%) | 14.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 257 |
| Q3 | 2895 |
| 95-th percentile | 14981 |
| Maximum | 1823866 |
| Range | 1823866 |
| Interquartile range (IQR) | 2895 |
Descriptive statistics
| Standard deviation | 13304.464 |
|---|---|
| Coefficient of variation (CV) | 3.952075 |
| Kurtosis | 2448.1241 |
| Mean | 3366.4502 |
| Median Absolute Deviation (MAD) | 257 |
| Skewness | 36.256743 |
| Sum | 3.6816004 × 109 |
| Variance | 1.7700875 × 108 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 320153 | 14.9% |
| 1 | 41366 | 1.9% |
| 2 | 13233 | 0.6% |
| 3 | 7326 | 0.3% |
| 4 | 5613 | 0.3% |
| 1024 | 5469 | 0.3% |
| 5 | 4678 | 0.2% |
| 1023 | 3794 | 0.2% |
| 6 | 3778 | 0.2% |
| 7 | 3202 | 0.1% |
| Other values (41540) | 685003 | |
| (Missing) | 1060433 |
| Value | Count | Frequency (%) |
| 0 | 320153 | |
| 1 | 41366 | 1.9% |
| 2 | 13233 | 0.6% |
| 3 | 7326 | 0.3% |
| 4 | 5613 | 0.3% |
| 5 | 4678 | 0.2% |
| 6 | 3778 | 0.2% |
| 7 | 3202 | 0.1% |
| 8 | 2920 | 0.1% |
| 9 | 2837 | 0.1% |
| Value | Count | Frequency (%) |
| 1823866 | 1 | |
| 1702309 | 1 | |
| 1556829 | 1 | |
| 1352304 | 1 | |
| 1326875 | 1 | |
| 1297464 | 1 | |
| 1272720 | 1 | |
| 1238915 | 1 | |
| 1154809 | 1 | |
| 1117735 | 1 |
ON_NET
Real number (ℝ)
High correlation  Missing  Zeros 
| Distinct | 9884 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 786675 |
| Missing (%) | 36.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 277.68914 |
| Minimum | 0 |
|---|---|
| Maximum | 50809 |
| Zeros | 108046 |
| Zeros (%) | 5.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 5 |
| median | 27 |
| Q3 | 156 |
| 95-th percentile | 1358 |
| Maximum | 50809 |
| Range | 50809 |
| Interquartile range (IQR) | 151 |
Descriptive statistics
| Standard deviation | 872.68891 |
|---|---|
| Coefficient of variation (CV) | 3.1426829 |
| Kurtosis | 116.85712 |
| Mean | 277.68914 |
| Median Absolute Deviation (MAD) | 26 |
| Skewness | 8.1479278 |
| Sum | 3.7970463 × 108 |
| Variance | 761585.93 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 108046 | 5.0% |
| 1 | 92118 | 4.3% |
| 2 | 58773 | 2.7% |
| 3 | 42296 | 2.0% |
| 7 | 41382 | 1.9% |
| 4 | 38699 | 1.8% |
| 8 | 38501 | 1.8% |
| 5 | 29845 | 1.4% |
| 6 | 29496 | 1.4% |
| 9 | 19640 | 0.9% |
| Other values (9874) | 868577 | |
| (Missing) | 786675 |
| Value | Count | Frequency (%) |
| 0 | 108046 | |
| 1 | 92118 | |
| 2 | 58773 | |
| 3 | 42296 | 2.0% |
| 4 | 38699 | 1.8% |
| 5 | 29845 | 1.4% |
| 6 | 29496 | 1.4% |
| 7 | 41382 | 1.9% |
| 8 | 38501 | 1.8% |
| 9 | 19640 | 0.9% |
| Value | Count | Frequency (%) |
| 50809 | 1 | |
| 45011 | 1 | |
| 38648 | 1 | |
| 36687 | 1 | |
| 34105 | 1 | |
| 33452 | 1 | |
| 32141 | 1 | |
| 31768 | 1 | |
| 30425 | 1 | |
| 29861 | 1 |
ORANGE
Real number (ℝ)
High correlation  Missing  Zeros 
| Distinct | 3167 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 895248 |
| Missing (%) | 41.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 95.418711 |
| Minimum | 0 |
|---|---|
| Maximum | 21323 |
| Zeros | 61623 |
| Zeros (%) | 2.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 7 |
| median | 29 |
| Q3 | 99 |
| 95-th percentile | 392 |
| Maximum | 21323 |
| Range | 21323 |
| Interquartile range (IQR) | 92 |
Descriptive statistics
| Standard deviation | 204.98727 |
|---|---|
| Coefficient of variation (CV) | 2.1482921 |
| Kurtosis | 189.03871 |
| Mean | 95.418711 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 8.0540159 |
| Sum | 1.2011307 × 108 |
| Variance | 42019.779 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 68881 | 3.2% |
| 0 | 61623 | 2.9% |
| 2 | 49435 | 2.3% |
| 3 | 36198 | 1.7% |
| 4 | 33933 | 1.6% |
| 8 | 25826 | 1.2% |
| 5 | 24649 | 1.1% |
| 6 | 22373 | 1.0% |
| 7 | 21218 | 1.0% |
| 10 | 20250 | 0.9% |
| Other values (3157) | 894414 | |
| (Missing) | 895248 |
| Value | Count | Frequency (%) |
| 0 | 61623 | |
| 1 | 68881 | |
| 2 | 49435 | |
| 3 | 36198 | |
| 4 | 33933 | |
| 5 | 24649 | 1.1% |
| 6 | 22373 | 1.0% |
| 7 | 21218 | 1.0% |
| 8 | 25826 | 1.2% |
| 9 | 19954 | 0.9% |
| Value | Count | Frequency (%) |
| 21323 | 1 | |
| 12040 | 1 | |
| 7660 | 1 | |
| 7314 | 1 | |
| 6788 | 1 | |
| 6721 | 1 | |
| 6555 | 1 | |
| 6429 | 1 | |
| 6416 | 1 | |
| 6319 | 1 |
TIGO
Real number (ℝ)
Missing  Zeros 
| Distinct | 1315 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1290016 |
| Missing (%) | 59.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 23.109253 |
| Minimum | 0 |
|---|---|
| Maximum | 4174 |
| Zeros | 94270 |
| Zeros (%) | 4.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 6 |
| Q3 | 20 |
| 95-th percentile | 95 |
| Maximum | 4174 |
| Range | 4174 |
| Interquartile range (IQR) | 18 |
Descriptive statistics
| Standard deviation | 63.578086 |
|---|---|
| Coefficient of variation (CV) | 2.7511961 |
| Kurtosis | 334.67472 |
| Mean | 23.109253 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 12.899932 |
| Sum | 19967134 |
| Variance | 4042.173 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 112165 | 5.2% |
| 0 | 94270 | 4.4% |
| 2 | 72530 | 3.4% |
| 3 | 53003 | 2.5% |
| 4 | 42980 | 2.0% |
| 5 | 34524 | 1.6% |
| 6 | 29421 | 1.4% |
| 7 | 26170 | 1.2% |
| 8 | 24304 | 1.1% |
| 9 | 20835 | 1.0% |
| Other values (1305) | 353830 | 16.4% |
| (Missing) | 1290016 |
| Value | Count | Frequency (%) |
| 0 | 94270 | |
| 1 | 112165 | |
| 2 | 72530 | |
| 3 | 53003 | |
| 4 | 42980 | 2.0% |
| 5 | 34524 | 1.6% |
| 6 | 29421 | 1.4% |
| 7 | 26170 | 1.2% |
| 8 | 24304 | 1.1% |
| 9 | 20835 | 1.0% |
| Value | Count | Frequency (%) |
| 4174 | 1 | |
| 3800 | 1 | |
| 3728 | 1 | |
| 3706 | 1 | |
| 3658 | 1 | |
| 3486 | 1 | |
| 2955 | 1 | |
| 2899 | 1 | |
| 2860 | 1 | |
| 2796 | 1 |
ZONE1
Real number (ℝ)
Missing  Skewed  Zeros 
| Distinct | 612 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 1984327 |
| Missing (%) | 92.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.1701322 |
| Minimum | 0 |
|---|---|
| Maximum | 4792 |
| Zeros | 59935 |
| Zeros (%) | 2.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 32 |
| Maximum | 4792 |
| Range | 4792 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 41.169511 |
|---|---|
| Coefficient of variation (CV) | 5.0390264 |
| Kurtosis | 1572.6889 |
| Mean | 8.1701322 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 25.708893 |
| Sum | 1386643 |
| Variance | 1694.9287 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 59935 | 2.8% |
| 1 | 41376 | 1.9% |
| 2 | 16858 | 0.8% |
| 3 | 9264 | 0.4% |
| 4 | 6044 | 0.3% |
| 5 | 4434 | 0.2% |
| 6 | 3233 | 0.2% |
| 7 | 2573 | 0.1% |
| 8 | 2134 | 0.1% |
| 9 | 2060 | 0.1% |
| Other values (602) | 21810 | 1.0% |
| (Missing) | 1984327 |
| Value | Count | Frequency (%) |
| 0 | 59935 | |
| 1 | 41376 | |
| 2 | 16858 | 0.8% |
| 3 | 9264 | 0.4% |
| 4 | 6044 | 0.3% |
| 5 | 4434 | 0.2% |
| 6 | 3233 | 0.2% |
| 7 | 2573 | 0.1% |
| 8 | 2134 | 0.1% |
| 9 | 2060 | 0.1% |
| Value | Count | Frequency (%) |
| 4792 | 1 | |
| 2507 | 1 | |
| 1986 | 1 | |
| 1867 | 1 | |
| 1839 | 1 | |
| 1804 | 1 | |
| 1730 | 1 | |
| 1684 | 1 | |
| 1659 | 1 | |
| 1657 | 1 |
ZONE2
Real number (ℝ)
Missing  Skewed  Zeros 
| Distinct | 486 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 2017224 |
| Missing (%) | 93.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.5533094 |
| Minimum | 0 |
|---|---|
| Maximum | 3697 |
| Zeros | 40440 |
| Zeros (%) | 1.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 2 |
| Q3 | 5 |
| 95-th percentile | 29 |
| Maximum | 3697 |
| Range | 3697 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 33.487234 |
|---|---|
| Coefficient of variation (CV) | 4.4334519 |
| Kurtosis | 2107.0549 |
| Mean | 7.5533094 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 30.885189 |
| Sum | 1033474 |
| Variance | 1121.3948 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 40440 | 1.9% |
| 1 | 26941 | 1.3% |
| 2 | 15428 | 0.7% |
| 3 | 9857 | 0.5% |
| 4 | 7393 | 0.3% |
| 5 | 4836 | 0.2% |
| 6 | 3723 | 0.2% |
| 7 | 3231 | 0.1% |
| 8 | 2360 | 0.1% |
| 9 | 2074 | 0.1% |
| Other values (476) | 20541 | 1.0% |
| (Missing) | 2017224 |
| Value | Count | Frequency (%) |
| 0 | 40440 | |
| 1 | 26941 | |
| 2 | 15428 | 0.7% |
| 3 | 9857 | 0.5% |
| 4 | 7393 | 0.3% |
| 5 | 4836 | 0.2% |
| 6 | 3723 | 0.2% |
| 7 | 3231 | 0.1% |
| 8 | 2360 | 0.1% |
| 9 | 2074 | 0.1% |
| Value | Count | Frequency (%) |
| 3697 | 1 | |
| 3143 | 1 | |
| 2008 | 1 | |
| 1796 | 1 | |
| 1618 | 1 | |
| 1351 | 1 | |
| 1346 | 1 | |
| 1324 | 1 | |
| 1321 | 1 | |
| 1316 | 1 |
| Value | Count | Frequency (%) |
| False | 2154048 |
REGULARITY
Real number (ℝ)
High correlation 
| Distinct | 62 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.042505 |
| Minimum | 1 |
|---|---|
| Maximum | 62 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 6 |
| median | 24 |
| Q3 | 51 |
| 95-th percentile | 62 |
| Maximum | 62 |
| Range | 61 |
| Interquartile range (IQR) | 45 |
Descriptive statistics
| Standard deviation | 22.286857 |
|---|---|
| Coefficient of variation (CV) | 0.79475271 |
| Kurtosis | -1.4871698 |
| Mean | 28.042505 |
| Median Absolute Deviation (MAD) | 20 |
| Skewness | 0.24740754 |
| Sum | 60404902 |
| Variance | 496.70399 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 195162 | 9.1% |
| 62 | 166477 | 7.7% |
| 2 | 118915 | 5.5% |
| 3 | 86027 | 4.0% |
| 4 | 68335 | 3.2% |
| 61 | 64431 | 3.0% |
| 5 | 56823 | 2.6% |
| 6 | 49771 | 2.3% |
| 60 | 47515 | 2.2% |
| 7 | 44483 | 2.1% |
| Other values (52) | 1256109 |
| Value | Count | Frequency (%) |
| 1 | 195162 | |
| 2 | 118915 | |
| 3 | 86027 | |
| 4 | 68335 | 3.2% |
| 5 | 56823 | 2.6% |
| 6 | 49771 | 2.3% |
| 7 | 44483 | 2.1% |
| 8 | 41208 | 1.9% |
| 9 | 37397 | 1.7% |
| 10 | 34883 | 1.6% |
| Value | Count | Frequency (%) |
| 62 | 166477 | |
| 61 | 64431 | 3.0% |
| 60 | 47515 | 2.2% |
| 59 | 39821 | 1.8% |
| 58 | 34710 | 1.6% |
| 57 | 31831 | 1.5% |
| 56 | 29166 | 1.4% |
| 55 | 27491 | 1.3% |
| 54 | 26417 | 1.2% |
| 53 | 25147 | 1.2% |
TOP_PACK
Text
Missing 
| Distinct | 140 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 902594 |
| Missing (%) | 41.9% |
| Memory size | 123.2 MiB |
Length
| Max length | 49 |
|---|---|
| Median length | 42 |
| Mean length | 23.185356 |
| Min length | 7 |
Unique
| Unique | 27 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | On net 200F=Unlimited _call24H |
|---|---|
| 2nd row | On-net 1000F=10MilF;10d |
| 3rd row | Data:1000F=5GB,7d |
| 4th row | Mixt 250F=Unlimited_call24H |
| 5th row | MIXT:500F= 2500F on net _2500F off net;2d |
| Value | Count | Frequency (%) |
| all-net | 387647 | 12.4% |
| 500f=2000f;5d | 317802 | 10.2% |
| net | 257671 | 8.3% |
| on | 238197 | 7.6% |
| 200f=unlimited | 152295 | 4.9% |
| call24h | 152295 | 4.9% |
| 2500f | 128824 | 4.1% |
| data | 127980 | 4.1% |
| data:490f=1gb,7d | 115180 | 3.7% |
| mixt | 91930 | 2.9% |
| Other values (186) | 1152864 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4097436 | 14.1% |
| 1871231 | 6.4% | |
| l | 1758228 | 6.1% |
| F | 1688162 | 5.8% |
| t | 1615813 | 5.6% |
| n | 1458987 | 5.0% |
| 2 | 1358477 | 4.7% |
| e | 1172573 | 4.0% |
| a | 1113680 | 3.8% |
| 5 | 1106572 | 3.8% |
| Other values (61) | 11774247 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29015406 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4097436 | 14.1% |
| 1871231 | 6.4% | |
| l | 1758228 | 6.1% |
| F | 1688162 | 5.8% |
| t | 1615813 | 5.6% |
| n | 1458987 | 5.0% |
| 2 | 1358477 | 4.7% |
| e | 1172573 | 4.0% |
| a | 1113680 | 3.8% |
| 5 | 1106572 | 3.8% |
| Other values (61) | 11774247 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29015406 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4097436 | 14.1% |
| 1871231 | 6.4% | |
| l | 1758228 | 6.1% |
| F | 1688162 | 5.8% |
| t | 1615813 | 5.6% |
| n | 1458987 | 5.0% |
| 2 | 1358477 | 4.7% |
| e | 1172573 | 4.0% |
| a | 1113680 | 3.8% |
| 5 | 1106572 | 3.8% |
| Other values (61) | 11774247 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29015406 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 4097436 | 14.1% |
| 1871231 | 6.4% | |
| l | 1758228 | 6.1% |
| F | 1688162 | 5.8% |
| t | 1615813 | 5.6% |
| n | 1458987 | 5.0% |
| 2 | 1358477 | 4.7% |
| e | 1172573 | 4.0% |
| a | 1113680 | 3.8% |
| 5 | 1106572 | 3.8% |
| Other values (61) | 11774247 |
FREQ_TOP_PACK
Real number (ℝ)
High correlation  Missing 
| Distinct | 245 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 902594 |
| Missing (%) | 41.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.2724615 |
| Minimum | 1 |
|---|---|
| Maximum | 713 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 5 |
| Q3 | 12 |
| 95-th percentile | 33 |
| Maximum | 713 |
| Range | 712 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 12.280443 |
|---|---|
| Coefficient of variation (CV) | 1.3243995 |
| Kurtosis | 61.726468 |
| Mean | 9.2724615 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 4.1120661 |
| Sum | 11604059 |
| Variance | 150.80928 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 251882 | 11.7% |
| 2 | 155396 | 7.2% |
| 3 | 116447 | 5.4% |
| 4 | 85552 | 4.0% |
| 5 | 68531 | 3.2% |
| 6 | 57092 | 2.7% |
| 7 | 49478 | 2.3% |
| 8 | 43188 | 2.0% |
| 9 | 38731 | 1.8% |
| 10 | 34641 | 1.6% |
| Other values (235) | 350516 | 16.3% |
| (Missing) | 902594 |
| Value | Count | Frequency (%) |
| 1 | 251882 | |
| 2 | 155396 | |
| 3 | 116447 | |
| 4 | 85552 | 4.0% |
| 5 | 68531 | 3.2% |
| 6 | 57092 | 2.7% |
| 7 | 49478 | 2.3% |
| 8 | 43188 | 2.0% |
| 9 | 38731 | 1.8% |
| 10 | 34641 | 1.6% |
| Value | Count | Frequency (%) |
| 713 | 1 | |
| 629 | 1 | |
| 624 | 1 | |
| 612 | 1 | |
| 592 | 1 | |
| 560 | 1 | |
| 544 | 1 | |
| 511 | 1 | |
| 452 | 1 | |
| 433 | 1 |
CHURN
Categorical
High correlation 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 119.1 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1750062 | |
| 1 | 403986 | 18.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1750062 | |
| 1 | 403986 | 18.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1750062 | |
| 1 | 403986 | 18.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2154048 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1750062 | |
| 1 | 403986 | 18.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2154048 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1750062 | |
| 1 | 403986 | 18.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2154048 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 1750062 | |
| 1 | 403986 | 18.8% |
Interactions
Correlations
| ARPU_SEGMENT | CHURN | DATA_VOLUME | FREQUENCE | FREQUENCE_RECH | FREQ_TOP_PACK | MONTANT | ON_NET | ORANGE | REGION | REGULARITY | REVENUE | TENURE | TIGO | ZONE1 | ZONE2 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ARPU_SEGMENT | 1.000 | 0.008 | 0.389 | 0.880 | 0.879 | 0.817 | 0.987 | 0.519 | 0.679 | 0.009 | 0.716 | 1.000 | 0.005 | 0.453 | 0.219 | 0.311 |
| CHURN | 0.008 | 1.000 | 0.001 | 0.148 | 0.108 | 0.011 | 0.009 | 0.017 | 0.007 | 0.034 | 0.557 | 0.008 | 0.050 | 0.007 | 0.005 | 0.009 |
| DATA_VOLUME | 0.389 | 0.001 | 1.000 | 0.331 | 0.296 | 0.229 | 0.379 | -0.098 | -0.021 | 0.004 | 0.302 | 0.389 | 0.017 | -0.014 | -0.022 | -0.000 |
| FREQUENCE | 0.880 | 0.148 | 0.331 | 1.000 | 0.951 | 0.867 | 0.871 | 0.438 | 0.528 | 0.053 | 0.691 | 0.880 | 0.005 | 0.335 | 0.083 | 0.194 |
| FREQUENCE_RECH | 0.879 | 0.108 | 0.296 | 0.951 | 1.000 | 0.894 | 0.887 | 0.476 | 0.562 | 0.046 | 0.678 | 0.879 | 0.004 | 0.362 | 0.088 | 0.185 |
| FREQ_TOP_PACK | 0.817 | 0.011 | 0.229 | 0.867 | 0.894 | 1.000 | 0.812 | 0.436 | 0.536 | 0.010 | 0.597 | 0.817 | 0.000 | 0.350 | 0.098 | 0.065 |
| MONTANT | 0.987 | 0.009 | 0.379 | 0.871 | 0.887 | 0.812 | 1.000 | 0.509 | 0.669 | 0.011 | 0.707 | 0.987 | 0.006 | 0.449 | 0.215 | 0.309 |
| ON_NET | 0.519 | 0.017 | -0.098 | 0.438 | 0.476 | 0.436 | 0.509 | 1.000 | 0.551 | 0.008 | 0.523 | 0.519 | 0.000 | 0.368 | 0.065 | -0.023 |
| ORANGE | 0.679 | 0.007 | -0.021 | 0.528 | 0.562 | 0.536 | 0.669 | 0.551 | 1.000 | 0.008 | 0.457 | 0.678 | 0.015 | 0.471 | 0.125 | 0.049 |
| REGION | 0.009 | 0.034 | 0.004 | 0.053 | 0.046 | 0.010 | 0.011 | 0.008 | 0.008 | 1.000 | 0.036 | 0.009 | 0.023 | 0.007 | 0.005 | 0.000 |
| REGULARITY | 0.716 | 0.557 | 0.302 | 0.691 | 0.678 | 0.597 | 0.707 | 0.523 | 0.457 | 0.036 | 1.000 | 0.716 | 0.016 | 0.323 | 0.054 | 0.043 |
| REVENUE | 1.000 | 0.008 | 0.389 | 0.880 | 0.879 | 0.817 | 0.987 | 0.519 | 0.678 | 0.009 | 0.716 | 1.000 | 0.005 | 0.453 | 0.219 | 0.311 |
| TENURE | 0.005 | 0.050 | 0.017 | 0.005 | 0.004 | 0.000 | 0.006 | 0.000 | 0.015 | 0.023 | 0.016 | 0.005 | 1.000 | 0.000 | 0.004 | 0.009 |
| TIGO | 0.453 | 0.007 | -0.014 | 0.335 | 0.362 | 0.350 | 0.449 | 0.368 | 0.471 | 0.007 | 0.323 | 0.453 | 0.000 | 1.000 | 0.077 | 0.021 |
| ZONE1 | 0.219 | 0.005 | -0.022 | 0.083 | 0.088 | 0.098 | 0.215 | 0.065 | 0.125 | 0.005 | 0.054 | 0.219 | 0.004 | 0.077 | 1.000 | 0.107 |
| ZONE2 | 0.311 | 0.009 | -0.000 | 0.194 | 0.185 | 0.065 | 0.309 | -0.023 | 0.049 | 0.000 | 0.043 | 0.311 | 0.009 | 0.021 | 0.107 | 1.000 |